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Battery  cell  total  capacity  refers  to  the  total  amount  of  charge  that  can  be  extracted  from  a  fully  charged 
cell.  Knowledge  of  the  present  total  capacity  value  is  important  to  being  able  to  calculate  the  maximum 
energy  storage  capability  of  a  battery  pack,  the  remaining  energy  in  a  battery  pack,  and  as  an  indicator  of 
the  battery’s  state  of  health.  We  show  that  traditional  methods  of  estimating  battery  cell  total  capacity, 
which  consider  noises  only  in  the  accumulated  ampere  hour  measurement,  are  biased.  Battery  cell  total 
capacity  must  be  estimated  with  knowledge  of  both  the  noises  on  the  state  of  charge  estimates  and  on  the 
accumulated  ampere  hour  measurements  used  to  compute  the  total  capacity  estimate.  We  demonstrate 
how  total  least  squares  gives  better  results  than  traditional  methods,  and  derive  an  approximate  weighted 
total  least  squares  algorithm  that  is  suitable  for  implementation  in  an  embedded  battery  management 
system. 
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1.  Introduction 

The  total  capacity  of  a  battery  cell  is  a  value,  usually  expressed  in 
ampere  hours  (Ah)  or  milliampere  hours  (mAh),  that  indicates  the 
maximum  electrical  charge  that  the  battery  cell  is  capable  of  hold¬ 
ing.  New  battery  cells  are  manufactured  with  certain  nominal  total 
capacities,  but  as  the  cells  age,  their  capacities  generally  decrease. 
Therefore,  being  able  to  accurately  estimate  the  total  capacity  of  a 
battery  cell  is  important  to  being  able  to  determine  the  health  of 
that  battery  cell,  the  maximum  energy  that  can  be  stored  in  that  cell, 
and  (with  the  additional  knowledge  of  present  state-of-charge),  the 
present  energy  stored  in  the  cell. 

The  state-of-charge  (SOC)  of  a  battery  cell  is  a  value  between  0% 
and  100%  that  indicates  the  relative  level  of  charge  presently  held 
by  the  battery  cell.  A  state-of-charge  of  1 00%  corresponds  to  a  “full” 
cell,  while  a  state-of-charge  of  0%  corresponds  to  an  “empty”  cell. 
State-of-charge  is  sometimes  referred  to  as  “residual  capacity”  and 
is  not  to  be  confused  with  the  battery  cell  total  capacity.  However, 
the  two  are  related  by  the  equation 

z(t2)  =  z(£i)  +  -!-  [  dr.  (1) 
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where  z(t2 )  is  the  battery  cell  SOC  at  time  t2,z(t\ )  is  the  battery  cell 
SOC  at  time  tt,  Q is  the  battery  cell  total  capacity  in  ampere-hours, 
i(t)  is  the  battery  cell  current  at  time  t  in  amperes,  r]  is  a  unitless 
efficiency  factor,  which  may  take  on  different  values  depending  on 
whether  the  current  is  positive  or  negative,  and  time  is  measured  in 
seconds.  The  factor  of  3600  converts  seconds  to  hours.  SOC  itself  is 
unitless.  Note  that  in  the  convention  used  herein,  discharge  current 
is  assumed  to  have  negative  sign  and  charge  current  is  assumed  to 
have  positive  sign.  Note  also  that  in  this  work  we  treat  total  capacity 
as  an  electrochemical  property  of  the  cell  that  is  independent  from 
both  temperature  and  rate.  (We  define  total  capacity  precisely  in 
Section  2.) 

Eq.  (1)  is  the  mathematical  basis  for  most  capacity  estimation 
methods.  We  can  rearrange  its  terms  to  get: 

(2) 

' - V - '  * 

y 

where  the  obvious  linear  structure  of  y  =  Qx  becomes  apparent. 
Using  a  regression  technique,  for  example,  one  may  compute  esti¬ 
mates  of  Q.  One  needs  only  to  find  values  for  “x”  and  “y”. 

The  problem  with  using  standard  (least  squares)  linear  regres¬ 
sion  techniques  is  that  both  the  integrated  current  value  y  and  the 
difference  between  state-of-charge  values  x  have  sensor  noise  or 
estimation  noise  associated  with  them.  The  least  squares  linear 
regression  problem  is  a  solution  to  the  equation  (y  -  Ay)  =  Qx;  that 
is,  there  is  noise  assumed  on  the  measurements  y,  but  not  on  the 
independent  variable  x.  However,  Eq.  (2)  is  implicitly  of  the  form 
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(y  -  Ay)  =  Q(x  -  Ax)  since  both  the  integrated  current  and  SOC  esti¬ 
mates  have  noise.  That  is,  because  estimates  of  SOC  are  generally 
imperfect,  there  will  be  noise  on  the  x  variable,  and  using  standard 
least  squares  linear  regression  results  in  an  inaccurate  and  biased 
estimate  of  battery  cell  total  capacity. 

The  usual  approach  to  counteract  this  problem  is  to  try  to  ensure 
that  the  SOC  estimates  are  as  accurate  as  possible  and  then  use 
standard  least  squares  estimation  anyway.  For  example,  [1]  puts 
constraints  on  how  the  capacity  is  estimated.  It  forces  the  cell  cur¬ 
rent  to  be  zero  before  the  test  begins  (so  that  the  cell  is  in  an 
equilibrium  state  and  the  first  SOC  estimate  is  as  accurate  as  pos¬ 
sible)  and  similarly  forces  the  cell  current  to  be  zero  after  the  test 
ends  (again,  so  that  the  cell  is  in  an  equilibrium  state  and  the  sec¬ 
ond  SOC  estimate  is  accurate).  This  procedure  eliminates  to  a  large 
extent  (but  not  completely)  the  error  in  the  x  variable,  and  makes 
the  regression  reasonably  accurate.  A  second  example  is  [8],  where 
battery  current  is  periodically  interrupted  and  a  test  performed  to 
estimate  the  cell  resistance  so  that  the  ongoing  SOC  estimates  may 
be  corrected  for  ohmic  voltage  losses.  Again,  the  basic  idea  is  to  try 
to  make  x  as  noise-free  as  possible. 

Both  of  these  methods  require  a  very  structured  and  invasive 
setting  for  determining  battery  cell  total  capacity.  The  battery  cell 
current  must  be  controlled  by  the  algorithm.  Furthermore,  neither 
correctly  handles  the  residual  noise  in  x:  while  they  minimize  the 
noise,  they  never  totally  eliminate  it. 

In  this  paper,  we  propose  a  method  for  determining  battery  cell 
total  capacity  in  a  non-invasive  setting  where  the  method  does  not 
impose  constraints  on  the  battery  cell  current,  and  where  noise 
in  the  SOC  estimates  is  correctly  accounted  for.  That  is,  the  algo¬ 
rithm  optimally  uses  the  noisy  x  and  y  data  to  compute  a  total 
capacity  estimate  that  is  unbiased  by  the  estimation  and  measure¬ 
ment  noises  in  x  andy.  This  method  is  based  on  total  least  squares 
methods  [2,4-7, 1 5,1 6],  rather  than  on  ordinary  least  squares  meth¬ 
ods.  Several  variations  are  derived,  with  the  final  version  able  to 
determine  an  accurate  estimated  battery  cell  total  capacity  in  a 
computationally  efficient  manner,  without  imposing  constraints  on 
the  battery  cell  current. 

It  is  not  the  intent  of  this  paper  to  propose  an  SOC  estima¬ 
tion  algorithm.  (For  a  survey  of  some  available  methods,  see  [9].) 
Indeed  the  total  capacity  estimation  techniques  proposed  herein 
are  agnostic  -  to  a  point  -  to  the  SOC  estimation  technique  used  to 
give  the  input  SOC  estimates  that  produce  x.  However,  some  SOC 
estimation  methods  may  not  be  appropriate  because  they  have  a 
strong  dependence  on  the  total  capacity  estimate,  leading  to  a  cir¬ 
cularity  of  dependencies.  For  example,  Coulomb  counting  may  not 
be  used  because  it  must  have  an  accurate  estimate  of  total  capacity 
to  give  an  accurate  estimate  of  SOC.  Using  Coulomb  counting  with 
the  methods  presented  herein  will  give  a  circular  dependence  that 
will  not  converge  and  will  eventually  go  unstable.  SOC  estimation 
methods  based  only  on  voltage  are  appropriate  although,  arguably, 
their  accuracy  tends  to  be  very  poor,  especially  with  cells  having  a 
very  flat  open  circuit  voltage  characteristic  (e.g.,  lithium-ion  cells 
with  an  iron  phosphate  cathode  chemistry,  “LFP”  cells).  Our  own 
preference  is  to  use  Kalman  filter  based  methods,  and  particularly 
sigma-point  Kalman  filters  (SPKF),  which  optimally  combine  volt¬ 
age  and  current  information  and  are  very  insensitive  to  errors  in 
the  value  used  for  total  capacity  when  making  SOC  estimates.  This 
allows  us  to  use  the  static  constant  Qnom  (the  nominal  capacity  of 
the  cell)  within  the  SPKF  when  computing  SOC  estimates  and  still 
have  accurate  results.  The  SOC  estimates  produced  by  SPKF  using 
the  static  value  Qnom  are  then  used  by  the  methods  proposed  herein 
to  produce  dynamic  estimates  Q  of  total  capacity  Q,  which  may 
be  used  for  state-of-health  determination.  Typical  SOC  estimation 
errors  using  SPKF  are  on  the  order  of  about  1%  for  lithium-ion  cells 
having  manganese  oxide  cathode  chemistries,  “LMO”  cells  [13,14]. 
In  unpublished  work,  we  have  also  validated  SPKF  for  LFP  cells, 


where  we  find  typical  errors  on  the  order  of  3-5%,  and  without 
requiring  reliance  on  a  precise  value  of  Qto  do  so. 

The  remainder  of  this  article  is  organized  as  follows:  Section  2 
defines  total  capacity  and  some  other  important  quantities.  Sec¬ 
tion  3  introduces  the  weighted  least  squares  and  weighted  total 
least  squares  methods,  as  well  a  method  to  evaluate  the  “goodness” 
of  a  total  capacity  estimation  methodology  overall,  and  a  method 
to  evaluate  the  dynamic  uncertainty  of  the  estimates  produced 
by  such  a  methodology.  Section  4  outlines  a  simplified  version 
of  the  weighted  total  least  squares  method  that  can  be  used  in 
some  scenarios,  and  is  the  motivation  for  the  recursive  approxi¬ 
mate  weighted  total  least  squares  method  proposed  in  Section  5. 
Simulation  results  to  demonstrate  the  features  and  limitations  of 
the  methods  are  presented  in  Section  6,  and  discussed  in  Section  7. 
Finally,  Section  8  summarizes  the  article’s  main  conclusions. 

2.  Defining  total  capacity 

We  now  introduce  some  definitions  in  order  to  carefully  define 
“total  capacity,”  with  the  purpose  of  differentiating  it  from  other 
terms  having  similar  names. 

Definition:  A  cell  is  fully  charged  when  its  open  circuit  voltage 
(OCV)  reaches  vh{T),  a  manufacturer  specified  voltage  that  may 
be  a  function  of  temperature  T.  (For  example,  typically  vh(25  °C)  = 
4.2  V  for  LMO  cells,  or  1^(25  °C)  =  3.6  V  for  LFP  cells.)  A  common 
method  to  bring  a  cell  to  a  fully  charged  state  is  to  execute  a 
constant-current  charge  profile  until  the  terminal  voltage  is  equal 
to  vh(T ),  followed  by  a  constant-voltage  profile  until  the  charg¬ 
ing  current  becomes  infinitesimal.  We  define  the  state-of-charge 
(SOC)  of  a  fully  charged  cell  to  be  100%. 

Definition:  A  cell  is  fully  discharged  when  its  OCV  reaches  vfT), 
a  manufacturer  specified  voltage  that  may  be  a  function  of  tem¬ 
perature  T.  (For  example,  typically  ^(25  °C)  =  3.0  V  for  LMO  cells, 
or  i//(25°C)  =  2.0  V  for  LFP  cells.)  A  cell  may  be  fully  discharged 
by  executing  a  constant-current  discharge  profile  until  its  termi¬ 
nal  voltage  is  equal  to  vfT),  followed  by  a  constant-voltage  profile 
until  the  discharge  current  becomes  infinitesimal.  We  define  the 
SOC  of  a  fully  discharged  cell  to  be  0%. 

Definition:  The  total  capacityQ  of  a  cell  is  the  quantity  of  charge 
removed  from  a  cell  as  it  is  brought  from  a  fully  charged  state  to  a 
fully  discharged  state.  While  the  SI  unit  for  charge  is  Coulombs,  it 
is  more  common  in  practice  to  use  units  of  ampere  hours  (Ah)  or 
milliampere  hours  (mAh)  to  measure  the  total  capacity  of  a  battery 
cell.  The  total  capacity  of  a  cell  is  not  a  fixed  quantity:  it  generally 
decays  slowly  over  time  as  the  cell  degrades. 

Definition:  The  discharge  capacityQ^rate^  of  a  cell  is  the  quantity 
of  charge  removed  from  a  cell  as  it  is  discharged  at  a  constant 
rate  from  a  fully  charged  state  until  its  loaded  terminal  voltage 
reaches  vfT).  Because  the  discharge  capacity  is  determined  based 
on  loaded  terminal  voltage  rather  than  open  circuit  voltage,  it  is 
strongly  dependent  on  the  cell’s  internal  resistance,  which  itself  is 
a  function  of  rate  and  temperature.  Hence,  the  discharge  capacity 
of  a  cell  is  rate  dependent  and  temperature  dependent.  Because 
of  the  resistive  IxR  drop,  the  discharge  capacity  is  less  than  the 
total  capacity  unless  the  discharge  rate  is  infinitesimal.  Likewise, 
the  SOC  of  the  cell  is  nonzero  when  the  terminal  voltage  reaches 
vfT)  at  a  non-infinitesimal  rate.  The  discharge  capacity  of  a  cell  at 
a  particular  rate  and  temperature  is  not  a  fixed  quantity:  it  also 
generally  decays  slowly  over  time  as  the  cell  degrades. 
Definition:  The  nominal  capacityQn0m  of  a  cell  is  a  manufacturer- 
specified  quantity  that  is  intended  to  be  representative  of  the 
1C- rate  discharge  capacity  Q1C  of  a  particular  manufactured  lot 
of  cells  at  room  temperature,  25  °C.  The  nominal  capacity  is  a  con¬ 
stant  value.  Since  the  nominal  capacity  is  representative  of  a  lot  of 
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cells  and  the  discharge  capacity  is  representative  of  a  single  indi¬ 
vidual  cell,  Qnom  ^  Qic  in  general,  even  at  beginning  of  life.  Also, 
since  Qnom  is  representative  of  a  discharge  capacity  and  not  a  total 
capacity,  Qnom  A  Q- 


Total  capacity  is  an  intrinsic  property  of  a  cell’s  electrode  mate¬ 
rials,  the  volume  of  electrode  active  materials,  and  the  design 
electrode  state-of-charge  ranges.  (Note  that  an  electrode’s  state-of- 
charge  is  a  distinct  quantity  from  the  cell  state-of-charge,  although 
the  two  are  related.  The  electrode  state-of-charge  is  the  value  of  v 
in  Lii,C6  in  a  graphite  anode,  the  value  of  w  in  LiwMn204  in  an  LMO 
cathode,  or  the  value  of  w  in  LiwFeP04  in  an  LFP  cathode,  for  exam¬ 
ple.)  Total  capacity  is  equal  to  the  count  of  vacant  positions  in  the 
cathode  lattice  structure  when  the  cell  is  fully  charged  that  would 
be  filled  with  lithium  ions  when  fully  discharged.  Equivalently,  it 
is  equal  to  the  count  of  vacancies  in  the  anode  lattice  structure 
when  the  cell  is  fully  discharged  that  would  be  filled  with  lithium 
ions  when  fully  charged.  Changing  temperature  does  not  change 
this  property,  nor  does  changing  the  rate  at  which  lithium  moves 
between  the  anode  and  cathode. 

The  purpose  of  this  paper  is  to  propose  an  optimal  method  to 
estimate  a  cell’s  total  capacity  Q.  We  will  assume  that  ij^  1  at  all 
values  of  current  and  temperature,  which  is  reasonably  accurate  for 
lithium-ion  type  cells  operated  according  to  manufacturer  specifi¬ 
cations,  where  self  discharge  and  side  reactions  causing  SEI  layer 
growth  and  lithium  plating  (and  so  forth)  can  be  neglected.  If  the 
cell  is  operated  outside  of  this  range,  then  ij  must  be  specifically 
modeled.  It  is  not  the  intention  of  this  paper  to  propose  a  method 
to  estimate  the  discharge  capacity  of  a  cell  at  a  particular  rate, 
although  this  can  be  computed  from  Qif  a  cell  model  is  known. 


3.  Weighted  least  squares  and  weighted  total  least  squares 

3.2.  Derivation  of  weighted  ordinary  least  squares 

Both  ordinary  least  squares  (OLS)  and  total  least  squares  (TLS), 
as  applied  to  battery  cell  total  capacity  estimation,  seek  to  find  a 
constant  Q  such  that  y  «  Qx  using  N-vectors  of  measured  data  x 
and  y.  The  i  th  element  xz  in  x  and  yz  in  y  correspond  to  data  col¬ 
lected  from  a  cell  over  an  interval  of  time,  where  xz  is  the  estimated 
change  in  state-of-charge  over  that  interval,  and  yz  is  the  accumu¬ 
lated  ampere  hours  passing  through  the  cell  during  that  period. 
Specifically, 


*i  = 

y*  = 


Z(tf  )  -  Z(tl  ) 

' ?  wW 


3600 


dr. 


for  time  interval  i 


The  vectors  x  and  y  must  be  at  least  one  sample  long  (N>  1),  but 
multiple  samples  may  be  used  to  obtain  better  total  capacity  esti¬ 
mates. 

The  OLS  approach  assumes  that  there  is  no  error  on  the  xz,  and 
models  the  data  as  y  =  Qx+  Ay,  where  Ay  is  a  vector  of  measure¬ 
ment  errors,  as  depicted  in  Fig.  1(a).  (The  error  bars  on  the  data  point 
are  meant  to  illustrate  the  uncertainties,  which  are  proportional  to 
<jyr)  Here,  we  assume  that  Ay  comprises  zero-mean  Gaussian  ran¬ 
dom  variables,  with  known  variances  a (which  are  not  necessarily 

equal  to  each  other).  OLS  attempts  to  find  an  estimate  Q  of  the  true 
cell  total  capacity  Q  that  minimizes  the  sum  of  squared  errors  Ay*. 
We  generalize  that  approach  here  slightly  to  allow  for  finding  a 
Q  that  minimizes  the  sum  of  weighted  squared  errors,  where  the 
weighting  takes  into  account  the  uncertainty  of  the  measurement. 
That  is,  we  desire  to  find  a  Q  that  minimizes  the  weighted  least 


Fig.  1.  Fitting  a  line  to  data  having  uncertainties:  (a)  WLS  fitting  to  data  with  errors 
ony,  only  and  (b)  WTLS  fitting  to  data  with  errors  on  both  x,  and  y,. 


squares  (WLS)  merit  function 

N  9  N  2 

„2  _  (y*  -  Yi )  _  (yz  -  Q*z) 

XWLS  -  2^  ,y2  ~  2^  a2 


(3) 


In  this  equation,  Yz  is  a  point  on  the  line  Y,  =  Qxz  corresponding  to 
the  measured  data  pair  (xz,  yz),  where  yz  is  assumed  to  have  noise 
but  Xj  has  no  noise. 

There  are  a  number  of  approaches  that  may  be  taken  to  solve 
this  problem,  but  one  that  will  serve  our  purposes  well  is  to  dif¬ 
ferentiate  Eq.  (3)  with  respect  to  Q  and  solve  for  Q  by  setting  the 
partial  derivative  to  zero. 


=  0 


1=1 

N 


If  we  define 


Xi 

Cl’"  =  2^’ 
i= i  ay< 


and 


_  _sr^yi 

2’n  ~l^n 2  ’ 


av- 
i=  1 


then  we  can  write  Qn  =  c2,n/ci,n-  The  two  quantities  C\  n  and  c2>n 
may  be  computed  recursively  to  minimize  storage  requirements 
and  to  even  out  computational  requirements  when  updating  Qn 
when  n  gets  large 


Cl,n  —  .n-1 

C2,n  =  ^2,n-l 


+ 


+ 


xi 


Gy 

yn 

Xnyn 


G 


yn 


The  recursive  approach  requires  an  initial  estimate  of  c\ >0  and  c2  0. 
One  approach  is  to  simply  set  cy 0  =  c2>0  =  0.  Alternately,  we  can  rec¬ 
ognize  that  a  cell  with  nominal  capacity  Qnom  has  that  capacity 
over  a  state-of-charge  range  of  1.0.  Therefore,  we  can  initialize 
with  a  synthetic  zeroth  “measurement”  where  x0  =  1  and  j/0  =  Qnom- 
The  value  for  can  be  set  to  the  manufacturing  variance  of  the 

nominal  capacity.  That  is,  c10  =  1  /ojQ  and  c2,o  =  Qnom/^. 

This  method  may  easily  be  adapted  to  allow  fading  memory  of 
past  measurements.  We  modify  the  WLS  merit  function  to  place 
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more  emphasis  on  recent  measurements.  We  define  the  fading 
memory  weighted  least  squares  (FMWLS)  merit  function  as 


N 


Xfmwls  ~ 


iCyi-Qxi)' 


z=i 


G 


yi 


(4) 


where  the  forgetting  factor  y  is  in  the  range  0<y<  1.  Then,  the 
solution  becomes 
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^/,-'(xiy,/cr2) 


Jfyn-Kx  f/tf.) 

1=1 

This  solution  may  also  easily  be  computed  in  a  recursive  manner. 
We  keep  track  of  the  two  running  sums  c1?n  =  Y^=  l  YN~lxj  I  °y[ 

C2,n  =  Z"=1  Ttlen>  0"  =  C2,n/ci,n-  When  an  additional 

data  point  becomes  available,  we  update  these  quantities  via 


Q,n  —  YQ,n-l  +  n 

Or, 

yn 

xnYn 

C2,n  =  YC2,n-!  + 
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yn 


This  yields 
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(x,--x,-r  l  cyi-y^ 
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We  set  the  partial  derivatives  9x^TLS,a/9xi  =  9X?vtls ,a/dYi  = 
9Xwtls  a/9^i  =  0-  This  gives  intermediate  result 


+  Qyrf. 


and  Y,  =  QX,. 


With  this  result,  we  can  re-write  Eq.  (6)  in  terms  of  known  quanti¬ 
ties  as 


N 

2  Cy 

^WTLS  ~  , 


Q*;)' 


02^-2  0-2 
i=l  ^  ^ 


(7) 


To  find  the  value  of  Q  that  minimizes  this  merit  function,  we  set 
the  partial  derivative  SXwm S/3Q.  =  0.  That  is, 

9/wtls  _  y ^2(Q-Xi  -yiXQy^Xj  +xi(Jyi)  _  Q  ^ 

9^  l&oit+Oyf 

Unfortunately,  this  solution  has  none  of  the  nice  properties  of  the 
WLS  solution.  Namely, 


In  summary,  the  WLS  and  FMWLS  solutions  have  a  number  of  nice 
properties: 


1.  They  give  a  closed-form  solution  for  Q.  No  iteration  or  advanced 
algorithms  are  required-only  simple  multiplication,  addition, 
and  division. 

2.  The  solutions  can  very  easily  be  computed  in  a  recursive  manner. 

3.  Fading  memory  can  easily  be  added  to  allow  the  estimate  Q  to 
place  greater  emphasis  on  more  recent  measurements  than  on 
earlier  measurements,  allowing  adaptation  of  Q  to  adjust  for  true 
cell  total  capacity  changes. 

3.2.  Derivation  of  weighted  total  least  squares 


1.  There  is  no  closed-form  solution  in  the  general  case;  a  numer¬ 
ical  method  must  be  used  instead  to  find  Q.  One  possibility  is 
to  perform  a  Newton-Raphson  search  for  Q  [15],  where  several 
iterations  of  the  equation 


Qi<  =  Qi<- 1 


d^WTLs/d(2 

^2^wtls/^2 


are  performed  every  time  the  data  vectors  x  and  y  are  updated 
with  new  data.  The  numerator  of  this  update  equation  is  the 
“Jacobian”  of  the  original  metric  function,  and  is  computed  as  Eq. 
(8).  The  denominator  of  this  update  equation  is  the  “Hessian”  of 
the  original  metric  function,  which  can  be  found  to  be 


The  TLS  approach  assumes  that  there  is  error  on  both  the  and 
j/j  measurements,  as  depicted  in  Fig.  1(b),  and  models  the  data  as 
(y-  Ay)  =  Q(x-  Ax).  (The  error  bars  on  the  data  point  are  meant 
to  illustrate  the  uncertainties  in  each  dimension,  which  are  pro¬ 
portional  to  gX[  and  Gyr )  We  assume  that  Ax  comprises  zero-mean 
Gaussian  random  variables,  with  known  variances  g$  and  that  Ay 
comprises  zero-mean  Gaussian  random  variables,  with  known  vari¬ 
ances  Gy. ,  where  g%.  is  not  necessarily  equal  to  or  related  to  g $ . 

TLS  attempts  to  find  an  estimate  Q.  of  the  true  cell  total  capacity 
Q.  that  minimizes  the  sum  of  squared  errors  Axz  plus  the  sum  of 
squared  errors  Ay*.  We  generalize  that  approach  here  slightly  to 
allow  for  finding  a  Q.  that  minimizes  the  sum  of  weighted  squared 
errors,  where  the  weighting  takes  into  account  the  uncertainty  of 
the  measurement.  That  is,  we  desire  to  find  a  Q.  that  minimizes  the 
weighted  total  least  squares  (WTLS)  merit  function 
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^WTLS 


=E 


(Xi-xo2  ,  (y/-Y-) 
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yi 


(6) 


In  this  equation,  X,  and  Y[  are  the  points  on  the  line  Y,  =  QX,  corre¬ 
sponding  to  the  noisy  measured  data  pair  (x,,y,  ).  Since  both  x,  andy, 
have  noise,  we  must  handle  this  optimization  problem  differently 
from  the  way  we  handled  the  WLS  problem  of  Eq.  (3).  We  use  the 
approach  of  reference  [6],  where  Lagrange  multipliers  are  used 
to  augment  the  merit  function  with  the  constraint  that  Y,  =  QX,. 


d2x2 
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(9) 


The  Newton-Raphson  search  can  be  initialized  with  a  WLS  esti¬ 
mate  of  Q,  and  has  the  property  that  the  number  of  significant 
figures  in  the  solution  doubles  with  each  iteration  of  the  update. 
In  practice,  we  find  that  around  four  iterations  produce  double¬ 
precision  results.  Note  that  the  metric  function  Xwtls  convex» 
so  this  iterative  method  is  guaranteed  to  converge  to  the  global 
solution  [7]. 

2.  There  is  no  recursive  update  in  the  general  case.  This  has  storage 
implications  and  computational  implications.  To  use  WTLS,  the 
entire  vector  x  and  y  must  be  stored,  which  implies  increasing 
storage  as  the  number  of  measurements  increase.  Furthermore, 
the  number  of  computations  grows  as  N  grows.  This  is  not  well 
suited  for  an  embedded-system  application  that  must  run  in  real 
time  with  limited  storage  capabilities. 

3.  There  is  no  fading  memory  recursive  update  (because  there  is  no 
recursive  update).  A  non-recursive  fading  memory  merit  function 
may  be  defined,  however,  as 
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The  Jacobian  of  this  merit  function  is 

9Xfmwtls  =  3y ^yN-i  ^x'  Z  +  X|'°*i  'l 

tt  (Q2ct2+CT2)2 

The  Hessian  is 
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A  Newton-Raphson  search  may  be  used  with  this  fading- 
memory  cost  function  to  find  an  estimate  of  Q. 


In  Section  4,  we  will  address  a  special  case  of  WTLS  that  gives  a 
closed-form  solution,  with  recursive  update,  and  fading  memory. 
In  Section  5,  we  will  give  an  approximate  solution  to  the  general 
WTLS  problem  that  also  has  these  nice  properties.  Before  we  do  so, 
we  first  consider  two  important  properties  of  both  the  WLS  and 
WTLS  solutions. 


3.3.  Evaluating  the  goodness  of  the  model  fit 


When  the  measurement  errors  Ax  and  Ay  are  uncorrelated  and 
Gaussian,  the  metric  functions  x^LS  and  Xwtls  are  chi-squared  ran¬ 
dom  variables.  x^LS  is  a  chi-squared  random  variable  with  N-  1 
degrees  of  freedom,  because  N  data  points  yf  were  used  in  its  cre¬ 
ation  and  one  degree  of  freedom  is  lost  when  fitting  Q.  Xwris  is 
a  chi-squared  random  variable  with  2N-1  degrees  of  freedom, 
because  N  data  points  x*  and  N  additional  data  points  y,  are  used 
in  its  creation,  and  one  degree  of  freedom  is  lost  when  fitting  Q. 
Knowledge  of  the  distribution  and  the  number  of  degrees  of  free¬ 
dom  can  be  used  to  determine,  from  the  optimized  values  of  the 
metric  functions,  whether  the  model  fit  is  reliable;  that  is,  whether 
the  linear  fit  is  a  good  fit  to  the  data,  and  whether  the  optimized 
value  of  Q  is  a  good  estimate  of  the  cell  total  capacity. 

The  incomplete  gamma  function  P(x2  |  v)  is  defined  as  the  prob¬ 
ability  that  the  observed  chi-square  for  a  correct  model  should 
be  less  than  a  value  x2  for  degree  of  freedom  v.  Its  complement, 
Q(X2  |  v)  =  1 -P(X2  |  y),  is  the  probability  that  the  observed  chi- 
square  will  exceed  the  value  x2  by  chance  even  for  a  correct  model.1 
Therefore,  to  test  for  goodness  of  fit  of  a  model,  we  must  evaluate 


Q(x2M 


1 

WJ2) 


e  ft^y/2  ^dt. 


Methods  for  computing  this  function  are  built  into  many  engineer¬ 
ing  analysis  programs,  and  c-language  code  may  be  found  in  [15]. 
If  the  value  obtained  for  Qix2  I  v)  is  small,  then  either  the  model  is 
wrong  and  can  be  statistically  rejected,  or  the  variances  cr2.  or  o'2  are 
poorly  known,  or  the  variances  are  not  actually  Gaussian.  The  third 
possibility  is  fairly  common,  but  also  generally  benign  if  we  are  con¬ 
tent  to  accept  low  values  of  Qix2  I  v)  as  representing  a  valid  model 
[15].  It  is  not  uncommon  to  accept  models  with  Qix2  |  v)>  0.001  and 
to  reject  them  otherwise.  We  will  see  that  when  the  hypothesized 
model  is  not  a  good  fit  to  the  data,  the  value  of  Qix2  |  v)  becomes 
extremely  small.  However,  when  the  hypothesized  model  is  equal 
to  the  true  model  generating  the  data,  even  when  Q.  is  not  precisely 
equal  to  Q  the  value  of  Qix2  |  v)  tends  to  be  very  close  to  unity.  We 
will  use  this  information  later  to  show  that  the  WLS  model  is  not  a 
good  approach  to  total  capacity  estimation,  whereas  WTLS  is  much 
better. 


1  Note  that  the  nomenclature  Q(x2|v)  is  standard  for  the  (complementary) 
incomplete  gamma  function,  and  is  not  to  be  confused  with  the  symbol  used  to 
denote  true  cell  total  capacity  Q,  or  with  the  symbol  used  to  denote  the  estimate  of 
cell  total  capacity  Q. 


3.4.  Evaluating  the  confidence  limits  on  the  estimated  total 
capacity  Q 


When  computing  an  estimate  of  cell  total  capacity  Q,  it  is  also 
important  to  be  able  to  specify  the  certainty  of  that  estimate. 


-  6Qx,yi  +yf) 


(12) 


Specifically,  we  would  like  to  estimate  the  variance  a?  of  the  total 

capacity  estimate,  with  which  we  can  compute  confidence  intervals 
such  as  three-sigma  bounds  (Q  -  3 cr^,  Q  +  3a^)  within  which  the 
true  value  of  cell  total  capacity  Q  lies,  with  high  certainty. 

To  derive  confidence  limits,  we  must  re-cast  the  least-squares 
type  optimization  problem  as  a  maximum-likelihood  optimization 
problem.  With  the  assumption  that  all  errors  are  Gaussian,  this 
is  straightforward.  If  we  form  a  vector  y  comprising  elements  yif 
and  a  vector  x  comprising  corresponding  elements  xz  and  a  diag¬ 
onal  matrix  having  corresponding  diagonal  elements  a2.,  then 

minimizing  Xwls  1S  equivalent  to  maximizing 

MtwLS  =  (2xAw/2  ^(-^(y-Qx/s^Cy-Qx)) 

=  (2^/2|i;y|./2eXP(“^WLs)’ 

which  is  a  maximum  likelihood  problem.  (The  constant  to  the  left  of 
the  exponential  causes  the  function  to  integrate  to  1 ,  yielding  a  valid 
probability  density  function.)  Similarly,  if  we  form  a  vector  d  con- 

A 

catenating  y  and  x,  and  a  vector  d  concatenating  the  corresponding 
elements  Yj  and  X,,  and  a  diagonal  matrix  Zd  having  diagonal  ele¬ 
ments  cr2.  followed  by  cr2 ,  then  minimizing  Xwris  1S  equivalent  to 
maximizing 


^  =  (2jt)w^£.|1/2  gXP  (~)(<1  ~  ~ 

■  “p  H*™15)' 

The  maximum-likelihood  formulation  makes  it  possible  to  deter¬ 
mine  confidence  intervals  on  Q.  According  to  the  Cramer-Rao 
theorem,  the  variance  of  Q  (more  precisely,  the  lower  bound  to 
the  variance)  is  given  by  the  negative  inverse  of  the  second  deriva¬ 
tive  of  the  argument  of  the  exponential  function,  evaluated  at  the 
Q.  that  minimizes  the  least-squares  cost  function  or  maximizes  the 
maximum-likelihood  cost  function  [4].  Therefore, 

a?  >  2  (  9  X,WLS^  for  WLS 

V  3Q2  / 

or?  >  2  ( (LXms  for  WTLS. 

Q"  V  9Q2  J 

The  second  partial  derivatives  of  the  WTLS  and  FMWTLS  metric 
functions  were  already  derived  in  Eqs.  (9)  and  (12).  For  WLS  and 
FMWLS,  we  have 


and 
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which  may  be  computed  using  the  previously  defined  recursive 
parameters  as 


92Xwls 
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4.  Simplified  method  with  proportional  confidence  on 
and  yi 


from  the  solution  of  the  quadratic  equation,  which  corresponds  to 
the  positive  root.  Recursive  calculation  is  done  via 


The  general  WTLS  solution  from  Section  3.2  provides  excellent 
results  but  is  impractical  to  implement  in  an  embedded  system. 
Therefore,  we  search  for  cases  that  lead  to  simpler  implementa¬ 
tions.  Here,  we  look  at  an  exact  solution  when  the  uncertainties 
on  the  X[  and  data  points  are  proportional  to  each  other  for  all 
i,  which  leads  to  a  simple  solution  that  can  easily  be  implemented 
in  an  embedded  system.  With  insights  from  this  solution  we  will 
next  look  at  an  approximate  WTLS  solution  in  Section  5  that  also 
has  nice  implementation  properties. 

If  oX[  =  koyi}  then  the  WTLS  merit  function  of  Eq.  (6)  reduces  to 
a  generalization  of  the  standard  TLS  merit  function 

„2  ~Xif  ,  (Ti  -  Yif  (Ti  -  Q*z)2  M  ^ 

“'tr  kl<  <  'tr<Q^+D<  ‘*  1 

Furthermore,  the  partial  derivative  of  the  WTLS  merit  function  of 
Eq.  (8)  reduces  to 


9Xtls  =  2  y  ( 92>  -  Vj  X  9Jfy± + * ) 

9(2  (Q2k2  +  D2o-y2i 
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This  equation  may  be  solved  for  an  exact  solution  to  Q,  without 
requiring  iteration  to  do  so.  We  first  collect  terms 
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where  initialization  is  done  by  setting  x0  =  1  andyo  =  Qnom.  is  set 
to  a  representative  value  of  the  uncertainty  of  the  total  capacity. 
Therefore,  c3?0  =  Q^0m/°'yi»  c2,o  =  Qnom/o^  and  c1>0  =  l/o£ ,  and 
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The  Hessian,  which  is  required  to  compute  the  uncertainty  of  the 
estimate,  may  also  be  found  in  terms  of  the  recursive  parameters: 


d2 Xjis  =  (-4 k4c2n3  +  6 k4c3Q}  +  (~6d  +  12 c2)fe2Q,  +  2(d  -  k2c3) 
3Q2  (Q2/<2  +  i)3 


This  can  be  used  to  predict  error  bounds  on  the  estimate  Q.  One- 

sigma  bounds  are  computed  as  ^2/(92x^LS/3Q.2). 

Fading  memory  may  be  easily  incorporated.  Recursive  calcula¬ 
tion  is  done  via 


Qn  = 
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We  simplify  notation  slightly  by  defining  c3?n  =  /^y,-  Then, 


where  initialization  is  done  by  setting  x0  =  1  andy0  =  Qnom-  cr$.  is  set 
to  a  representative  value  of  the  uncertainty  of  the  total  capacity. 
Therefore,  c3?0  =  Q^om/oryf.  ?2,o  =  Qnom/^y.  and  c1>0  =  1/a2  and 


Qn  = 


(Ci,„  -  k2c3i„)  ±  J{ciy„-k2c3y„)2  +4k2c2  n 
2k2C2,n 


Which  of  the  two  roots  to  choose?  We  can  show  that  this  quadratic 
equation  always  has  one  positive  root  and  one  negative  root.  This 
can  be  proven  by  forming  the  Routh  array,  and  performing  the 
Routh  test  on  its  values  [3].  The  Routh  array  is: 


Q2 

Q1 

Q() 


,n 


0 

0 


The  first  column  of  the  Routh  array  always  has  exactly  one  sign 
change,  so  there  is  one  root  of  the  polynomial  in  the  right-half  plane. 
The  other  root,  therefore,  must  be  in  the  left-half  plane.  By  the  fun¬ 
damental  theorem  of  algebra,  because  the  coefficients  c\ >n,  c2>n,  and 
c3  n  are  real,  the  polynomial  roots  must  either  both  be  real  or  be 
complex  conjugates.  The  fact  that  they  are  in  different  halves  of  the 
complex  plane  shows  that  they  cannot  be  complex  conjugates,  and 
therefore  must  both  be  real.  Therefore,  we  choose  the  largest  root 


Cl,n  = 

x\ 

C2,„  =  yc2,  „_i  +  -HjS- 

yi 

C3,n  =  Yc3,n- 1  H - y- 


After  some  straightforward  manipulations,  we  can  obtain  the  Hes¬ 
sian  in  terms  of  the  recursive  parameters  C\  through  c3: 

^FMTLS 

do2 

_  (-4k4c2)Q3  +  6 kAc3Q2  +  (— 6ci  +  12c2)fc2Q  +  2(ct  -  k2c3 ) 

(Q2k2  +  1)3 


In  summary,  this  TLS  solution  shares  the  nice  properties  of  the 
WLS  solution: 


1.  It  gives  a  closed-form  solution  for  Q.  No  iteration  or  advanced 

algorithms  are  required— only  simple  multiplication,  addition, 

and  division. 
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Fig.  2.  Geometry  of  WTLS:  (a)  mapping  between  (x;,  y,)  and  (X,,  Y,)  for  WTLS  with  unequal  confidence  on  x,  and  y,  ;  (b)  mapping  between  (x,,  y,)  and  (X,,  Y,)  for  WTLS  with 
equal  confidence  onx/  andy,  ;  and  (c)  definitions  for  derivation  of  approximate  WTLS. 


2.  The  solution  can  be  very  easily  computed  in  a  recursive  manner. 
We  keep  track  of  the  three  running  sums  ci>n,  c2>n  and  c3  n.  When 
an  additional  data  point  becomes  available,  we  update  the  sums 
and  compute  an  updated  total  capacity  estimate. 

3.  Fading  memory  is  easily  added. 

Unfortunately,  this  solution  does  not  allow  <r2  and  ex2.  to  be 
arbitrary— they  must  be  proportionally  related  by  the  scaling  factor 
oX[  =  1«Tyr  The  next  section  describes  an  approximation  to  TLS  that 
allows  an  arbitrary  relationship. 

5.  Approximate  full  solution  with  recursive  formulation 


is  0  =  tan-1  Q.  The  shortest  distance  between  the  line  and  a  given 

data  point  is  Rt  =  Ay*  cos  0  =  Ayt/ y/ 1  +  tan2  0  =  Ayjy/ 1  +  Q2. 

Let  8xi  =  RjS\n6  and  <5y,-  =  ft, cos  0.  These  are  the  x-  and  y- 
components  of  the  perpendicular  distance  between  data  point  i  and 
the  fitting  line.  We  then  weigh  our  fitting  merit  function  according 
to  these  variances.  Therefore,  we  define  the  approximate  weighted 
total  least  squares  (AWTLS)  merit  function  as 
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Note  that  sin2  0  =  1  -  cos2  0  =  Q2/{  1  +  Q2).  Therefore, 


We  desire  an  approximate  solution  to  the  WTLS  problem  that 
allows  a2  and  a2,  to  be  non-proportional,  but  which  yields  a  recur¬ 
sive  solution  for  feasible  implementation  in  an  embedded  system. 
Fig.  2  shows  the  geometry  of  WTLS  and  motivates  the  approximate 
solution  to  be  developed  in  this  section. 

Fig.  2(a)  shows  the  relationship  between  data  point  (x,-,  y,)  and 
its  optimized  map  (X,-,  Y,-)  on  the  line  Y,-  =  QX,  when  cr2  and  or2,  are 
arbitrary.  (The  error  bars  on  the  data  point  are  meant  to  illustrate 
the  uncertainties  in  each  dimension,  which  are  proportional  to  oX[ 
and  (7yr)  We  see  that  the  distance  between  x,-  and  X/  is  not  neces¬ 
sarily  equal  to  the  distance  between  y,-  and  Y,-.  Indeed,  if  we  know 
that  the  quality  of  the  x,-  measurement  is  better  (poorer)  than  the 
quality  of  they,  measurement,  the  distance  to  its  mapX,  on  the  line 
should  be  shorter  (greater)  than  the  distance  from  y,  to  its  map  Y, 
on  the  line. 

Fig.  2(b)  shows  the  relationship  between  data  point  (x,-,  y,)  and 
its  optimized  map  (X,,  Y,)  on  the  line  Y,  =  QX,  when  cr2  and  or2,  are 
equal.  In  this  case,  the  distance  between  x,  and  X,  is  equal  to  the 
distance  between  y,  and  Y,,  and  the  line  joining  data  point  (x,,  y,) 
and  (X,,  Y,)  is  perpendicular  to  the  line  Y,  =  QX,.  (If  oX[  and  oy.  are 
not  equal  but  proportional,  the  x-  ory-axis  may  be  scaled  to  yield 
transformed  data  points  with  equal  variances,  and  hence  the  same 
idea  applies.) 

Fig.  2(c)  illustrates  the  definitions  that  will  be  used  to  derive 
an  approximate  weighted  total  least  squares  (AWTLS)  solution.  As 
with  the  TLS  solution,  we  enforce  that  the  line  joining  data  point 
(x,,  y,)  and  (X,,  Y,)  be  perpendicular  to  the  line  Y,  =  QX,.  This  will 
result  in  a  solution  that  may  be  solved  recursively.  However,  as 
with  the  WTLS  solution,  we  weight  the  distance  between  x,  and  X, 
differently  from  the  distance  between  y,  and  Y,  in  the  optimization 
merit  function.  This  will  give  a  better  total  capacity  estimate  than 
TLS  when  the  uncertainties  on  x,  and  y,  are  not  proportional. 

We  define  Ax,  be  the  x-distance  between  data  point  i  and  the 
line,  and  Ay,  be  the  y-distance  between  data  point  i  and  the  line. 
The  slope  of  the  line  is  Q  =  Ay, /Ax,  for  all  i.  The  angle  of  the  line 


Q2 

1  +  Q2 

1 

1  +  Q2 


Therefore,  since  Ay,  =y,  -  Qx,, 
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(20) 


To  verify  that  AWTLS  is  an  approximation  to  WTLS  in  at  least  some 
cases,  we  note  that  the  two  merit  functions  are  equal  when  crx.  = 
oy[  However,  they  are  not  equal  when  oX[  =  koy. ,  but  this  will  be 
corrected  at  the  end  of  this  section. 

The  Jacobian  of  the  AWTLS  merit  function  is 


AWTLS 

dQ 


(21) 


This  can  be  re-written  in  terms  of  recursively  computed  running 
summations 


AWTLS 

3Q 


(Q2  + 1)' 


(C5Q4  +  (2^4  -  c\  -  0>)Q~ 


+  (3c2  -3c5)Q2  +(ct  -2c3  +  c6)Q-c2)  . 

where  initialization  is  done  by  setting  x0  =  1  and  yo  =  Qnom.  <r20  is 
set  to  a  representative  value  of  the  uncertainty  of  the  total  capacity 
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and  g2  js  se t  to  a  representative  value  of  the  uncertainty  of  a  dif¬ 
ference  between  two  SOC  estimates.  Therefore,  Ci?0  =  1  /cr*  c2, o  = 
Qnom/^yg,  03,0  —  Qnom/^yo’  ^4,0  —  1/^xq’  05,0  =  Qnom/^xo’  05,0  — 
Qnom/^xo’ 


O  ,n  —  0,n-l 
02,n  =  0?,n-l 

03, n  =  03,n-l 
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The  roots  of  this  quartic  equation 

C5Q4  +  (2 c4  -  Ci  -  c6)Q3  +  (3c2  -  3c5)Q2 
+  (ci  -2c3+c6)Q-c2  =0 


are  candidate  solutions  for  Q.  They  may  be  found  using  the  Fer¬ 
rari  method,  a  closed-form  solution  that  does  not  require  iterative 
or  eigenvalue  analysis,  so  may  be  implemented  readily  in  an 
embedded  system  [17].  However,  of  the  four  roots  only  one  is  the 
optimum,  and  we  do  not  know  any  method  to  decide  a  priori  which 
root  to  solve  for.  Indeed,  in  our  experience,  with  some  sets  of  data 
all  roots  are  real,  but  with  other  sets  of  data  some  can  be  complex, 
and  some  can  be  negative.  The  only  foolproof  method  we  know 
to  determine  which  root  is  the  solution  that  minimizes  X^wtls  is 
to  evaluate  the  merit  function  Xawtls  at  each  of  the  four  candidate 
solutions,  and  to  retain  the  one  that  gives  the  lowest  value.  (This  can 
be  skipped  for  negative  and  complex  roots,  which  are  not  possible 
solutions  for  battery  cell  capacity.)  Computing  the  merit  function 
may  be  very  readily  done  if  we  rewrite  it  in  terms  of  the  running 
summations 

xiwTLS  =  - - - 2  (C4^4  “  2C5^3  +  (C1  +  C6)Q2  “  2C2Q.  +  C3)  . 

(Q2  +  1) 


When  the  assumptions  made  in  deriving  AWTLS  are  approximately 
true,  the  Hessian  yields  a  good  value  for  the  error  bounds  on 
the  total  capacity  estimate.  After  some  straightforward  but  messy 
mathematics,  we  can  find  the  Hessian  to  be 


d2x 


2 

AWTLS 


- — — j(-2c5Q5  +  (3d  -  6 c4  +  3c6)Q4 

(Q2  +  l) 

+  (-12c2  +  16c5)Q3  +  (-8ci  +  10c3  +  6c4  -  8c6)Q.2 
+(12c2  -6c5)Q  +  (c!  -2c3  +  c6)).  (22) 


Fading  memory  can  be  easily  incorporated.  The  cost  function  is 
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The  Jacobian  is 
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This  can  be  re-written  in  terms  of  recursively  computed  running 
summations 
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where  initialization  is  done  by  setting  x0  =  1  and  y0  =  Qnom.  cr2o  is 
set  to  a  representative  value  of  the  uncertainty  of  the  total  capacity 
and  cr20  is  set  to  a  representative  value  of  the  uncertainty  of  a  dif¬ 
ference  between  two  SOC  estimates.  Therefore,  Ci?0  =  1  /g2q,c2,o  = 
Qnom/oy  C3  0  Qnom /^yn*  ^4,0  —  1 /0*n *  05,0  —  Qnom/Oxn’  05,0  — 
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Again,  this  quartic  equation  may  be  solved  using  the  Ferrari 
method,  for  example.  The  four  candidate  solutions  must  be  checked 
against  the  merit  function  to  determine  which  one  is  optimal.  The 
merit  function  in  terms  of  these  variables  is 
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The  Hessian  is 
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(c4Q4  -  2 C5Q3  +  (Cl  +  c6)Q2  -  2c2Q  +  c3)  . 
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4  (— 2?5  Q5  +  (3ci  —  6c4  +  3cg)Q4 

(Q2  +  l) 

+  (— 12c2  +  16c5)Q3  +  (— 8ci  +  IOC3  +  6c4  —  8cg)Q.2 
+  (12c2  —  605)0,  +  (ci  —  2C3  +  Cg)).  (25) 


Note  that  the  AWTLS  merit  function  in  Eq.  (20)  does  not  equal 
the  WTLS  merit  function  in  Eq.  (7)  when  Gy[  =  kGX[.  This  can  be 
easily  remedied.  Define  yz-  =  kyz.  Then  Gy  =  gx..  Invoke  the  AWTLS 

or  FMAWTLS  methods  to  find  total  capacity  estimate  Q.  and  Hes¬ 
sian  H  using  input  sequences  comprised  of  the  original  x  vector 
and  the  scaled  y  vector  (i.e.,  (xz,yz)  with  corresponding  variances 
(cf2  ,  k2G '2  )).  The  true  slope  estimate  can  be  found  as  Qcorrected  = 

Q/k  and  the  corrected  Hessian  can  be  found  as  Hcorrected  =H//<2.  This 
is  the  method  used  in  the  results  section,  where  the  proportion¬ 
ality  constant  is  estimated  as  k  =  gX]  /oryi .  This  scaling  improves 
results  even  when  ay.  and  gX[  are  not  proportionally  related,  if  k  is 
chosen  to  give  an  “order  of  magnitude”  proportionality  or  average 
proportionality  between  the  uncertainties  ofxz  andyz. 

In  summary,  these  AWTLS  solutions  share  the  nice  properties  of 
the  WLS  solution: 
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Fig.  3.  Results  from  hybrid  electric  vehicle  application  simulation  scenario  1. 


1.  They  give  a  closed-form  solution  for  Q.  No  iteration  or  advanced 
algorithms  are  required— only  simple  multiplication,  addition, 
and  division. 

2.  The  solution  can  be  very  easily  computed  in  a  recursive  manner. 
We  keep  track  of  the  six  running  sums  C\  n  through  c6jl.  When 
an  additional  data  point  becomes  available,  we  update  the  sums 
and  compute  an  updated  total  capacity  estimate. 

3.  Fading  memory  can  easily  be  added  to  allow  the  estimate  Q  to 
place  greater  emphasis  on  more  recent  measurements  than  on 
earlier  measurements,  allowing  adaptation  of  Q  to  adjust  for  true 
cell  total  capacity  changes. 

4.  Furthermore,  this  method  is  superior  to  the  TLS  solution  since  it 
allows  individual  weighting  on  the  xz  andyz  data  points. 

6.  Simulations  to  test  the  methods 

This  section  presents  a  number  of  usage  scenarios  to  exercise 
the  four  different  total  capacity  estimation  methods  and  compare 
their  performance.  All  scenarios  use  the  fading-memory  version  of 
the  four  methods,  but  we  omit  the  prefix  “FM”  for  brevity.  Unless 
otherwise  mentioned,  the  fading  memory  forgetting  factor  y  =  1.0. 

We  assume  that  the  individual  SOC  estimates  that  are  input  to 
these  methods  can  be  determined  to  an  accuracy  of  az  =  0.01 .  This  is 
being  very  generous,  since  the  best  method  we  are  aware  of,  SPKF, 
achieves  only  around  az  =  0.01  for  LMO  cells  and  <rz  =  0.03  for  LFP 
cells  in  practice,  when  Qnom  is  used  instead  of  Q  in  the  estimator. 
Other  methods  we  have  used,  such  as  extended  Kalman  filtering 
or  EKF  [10-12],  achieve  around  oz  =  0.02  or  higher  for  LMO  cells  in 
practice.  (A  nice  advantage  of  both  EKF  and  SPKF  is  that  they  give 
dynamic  estimates  of  oz  that  ensure  that  the  values  of  ax.  used  in 
total  capacity  estimation  are  accurate.)  However,  despite  limita¬ 
tions  in  present  SOC  estimation,  we  are  confident  that  with  better 
cell  modeling,  these  methods  can  be  improved  in  the  future. 

We  use  computer  simulation  rather  than  cell  testing  to  validate 
the  algorithms  because  it  allows  us  to  constrain  a  variety  of  factors 
that  would  be  difficult  to  control  in  a  real-time  embedded  sys¬ 
tem.  These  include:  the  efficacy  and  accuracy  of  the  SOC  estimation 
algorithms  used  to  provide  input  to  the  total  capacity  estimation 
algorithms;  the  accuracy  and  precision  of  the  raw  sensor  mea¬ 
surements  used  as  input  (including  the  challenges  of  bias  errors, 
nonlinear  errors,  and  random  errors,  for  example);  the  repeatabil¬ 
ity  of  the  experiment;  and  the  fact  that  total  capacity  of  a  physical 
cell  fades  over  time  and  the  associated  difficulty/impossibility  of 
knowing  the  “true”  value  of  total  capacity  with  which  to  compare 
our  results.  Therefore,  we  choose  to  use  synthetic  data  to  isolate 
the  performance  of  the  total  capacity  estimation  algorithms  them¬ 


selves,  when  all  other  factors  are  in  some  sense  idealized.  The  xz  and 
yz  values  are  mathematically  generated,  as  described  in  the  individ¬ 
ual  subsections  below.  We  are  currently  collecting  long-term  life 
test  data  from  both  LMO  and  LFP  cells,  which  we  intend  to  use  in 
future  publications  to  evaluate  the  effectiveness  of  these  and  other 
algorithms  over  the  lifetime  of  a  cell. 

6.1.  Hybrid  electric  vehicle  application,  scenario  1 

The  first  sets  of  simulations  that  we  present  are  for  hybrid 
electric  vehicle  scenarios.  From  the  perspective  of  total  capacity 
estimation,  these  applications  are  characterized  by  the  narrow  win¬ 
dow  of  SOC  used.  We  assume  that  the  vehicle  uses  a  SOC  range  of 
40-60%.  Therefore,  each  time  the  total  capacity  estimate  is  updated, 
the  true  change  in  SOC  can  range  from  -0.2  to  +0.2.  We  simulate 
this  by  choosing  the  true  value  ofxz  to  be  a  uniform  random  number 
selected  between  these  limits. 

The  HEV  application  is  also  characterized  by  the  fact  that  the  bat¬ 
tery  pack  is  never  fully  charged  to  a  precisely  known  SOC;  therefore, 
each  time  the  total  capacity  estimate  is  updated,  two  estimates  of 
SOC  are  required  to  compute  x  =  z(t2)-z(ti).  This  gives  an  overall 
cr2  =  2 cr2  =  2(0.01  )2.  We  simulate  this  by  computing  the  “mea¬ 
sured”  value  of  xz  to  be  equal  to  the  true  value  of  xz  added  to  a 
zero-mean  Gaussian  random  number  having  variance  <r2. 

We  compute  the  true  value  of  yz  to  be  equal  to  the  nomi¬ 
nal  capacity  of  the  cell  Qnom  multiplied  by  the  true  value  of  xz. 
Noise  on  the  yz  measurement  is  assumed  to  comprise  accumu¬ 
lated  quantization  noises  (for  other  noise  factors,  see  section  7). 
For  yz  computed  by  summing  mz  measurements,  taken  at  a  1  Hz 
rate,  from  a  sensor  having  quantizer  resolution  q,  the  total  noise 
is  <r2.  =  q2mz/(  12  x  36002).  For  HEV  scenario  1,  we  assumed  that 
the  maximum  range  of  the  current  sensor  is  ±30QnOm  and  that  a 
10  bit  A2D  is  used  to  measure  current.  This  leads  to  q  =  60/1024. 
We  chose  mz  =  300  s  for  every  measurement  and  a  nominal  capacity 
of  Qnom  =  10  Ah.  The  recursive  estimates  were  not  initialized  prior 
to  the  first  data  point  being  received  (equivalently,  the  recursive 
parameters  were  initialized  to  zero). 

Results  from  this  scenario  are  presented  in  Fig.  3.  The  left  frame 
shows  estimates  made  using  the  four  methods  evolving  over  time 
as  thick  lines,  and  their  three-sigma  error  bounds,  computed  using 
the  Hessian  method,  as  thin  lines.  We  see  that  WTLS,  TLS,  and 
AWTLS  give  identical  estimates  and  error  bounds  under  this  sce¬ 
nario,  and  converge  to  the  neighborhood  of  the  true  total  capacity. 
The  WLS  estimate  is  biased,  and  the  error  bounds  are  (incorrectly) 
so  tight  that  they  are  indistinguishable  from  the  estimate  itself. 
The  right  frame  shows  the  goodness  of  fit  metric  as  applied  to  the 
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HEV  Scenario  2:  Goodness  of  fit  for  each  method 
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Fig.  4.  Results  from  hybrid  electric  vehicle  application  simulation  scenario  2. 
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four  methods.  Again,  WTLS,  TLS,  and  AWTLS  give  identical  results, 
quickly  converging  to  a  value  of  1.0  (i.e.,  the  methods  are  confident 
that  their  estimate  is  reliable).  The  WLS  method  returns  a  vanish¬ 
ingly  small  value  for  goodness  of  fit,  reflecting  the  fact  that  the 
method  does  not  give  a  good  value  for  its  total  capacity  estimate 
and/or  bounds. 

6.2.  Hybrid  electric  vehicle  application ,  scenario  2 

The  second  HEV  scenario  is  identical  to  the  first,  except  that 
the  recursive  methods  are  initialized  with  a  total  capacity  estimate 
before  further  measurements  are  received.  In  this  case,  the  methods 
were  initialized  with  a  “nominal”  capacity  estimate  of  9.9  Ah  (the 
true  total  capacity  was  still  10.0).  Results  are  presented  in  Fig.  4. 
In  this  scenario,  TLS  and  AWTLS  give  identical  results  for  both  their 
estimates,  error  bounds,  and  goodness  of  fit.  WTLS  cannot  be  calcu¬ 
lated  recursively,  so  the  estimation  cannot  be  initialized— its  results 
are  the  same  as  for  scenario  1.  Once  again,  WLS  is  inferior  to  the 
other  methods.  TLS  and  AWTLS  give  the  best  results  because  of 
tighter  error  bounds  while  maintaining  a  high  value  for  goodness 
of  fit. 

63.  Hybrid  electric  vehicle  application ,  scenario  3 

In  the  third  HEV  scenario,  we  explore  the  ability  of  the  algo¬ 
rithms  to  track  a  moving  value  of  total  capacity.  This  scenario  is 
identical  to  HEV  scenario  2,  except  that  the  true  total  capacity  is 


changing  with  a  slope  of  -0.001  Ah  per  measurement  update,  and 
a  fading  memory  forgetting  factor  of  y  =  0.99  is  used  for  all  meth¬ 
ods.  Results  are  presented  in  Fig.  5,  where  the  true  total  capacity 
is  drawn  as  a  dotted  black  line.  In  this  example,  the  WLS  method 
appears  to  give  good  results,  but  its  goodness  of  fit  value  is  still 
vanishingly  small  because  the  error  bounds  are  unreasonably  tight, 
and  almost  never  surround  the  true  value  of  total  capacity.  WTLS, 
TLS,  and  AWTLS  are  also  able  to  track  the  moving  value  of  total 
capacity— TLS  and  AWTLS  give  the  best  results  due  to  the  ability 
to  initialize  with  a  reasonable  initial  value,  yielding  narrower  error 
bounds. 

6.4.  Battery  electric  vehicle  application,  scenario  l 

The  next  scenarios  that  we  consider  are  typical  of  battery  electric 
vehicle  and  plug-in  hybrid  electric  vehicle  operation.  These  are  dif¬ 
ferent  from  HEV  application  in  several  respects:  the  battery  total 
capacity  is  larger,  the  relative  rate  of  energy  usage  is  lower,  the 
range  of  SOC  used  by  the  vehicle  is  larger,  and  the  BEV  battery  pack 
is  sometimes  fully  charged  to  a  known  set  point.  In  all  cases,  we 
consider  a  battery  pack  with  total  capacity  of  Qnom  =  100  Ah,  and  a 
maximum  rate  of  ±5Qnom-  We  again  assume  a  1 0-bit  current  sensor, 
which  gives  q  =  10/1024  and  <t2.  =  q2mj/(12  x  36002). 

For  the  first  BEV  scenario  we  assume  that  the  total  capacity  esti¬ 
mate  is  updated  on  a  regular  basis  as  the  vehicle  operates,  with 
mz  =  7200  s.  We  assume  that  the  battery  SOC  can  change  by  ±40%  in 
that  interval,  so  the  true  value  of  xt  is  chosen  to  be  a  uniform  ran- 
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Fig.  5.  Results  from  hybrid  electric  vehicle  application  simulation  scenario  3. 
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Fig.  6.  Results  from  battery  electric  vehicle  application  simulation  scenario  1 
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dom  variable  between  -0.4  and  +0.4.  Again,  noise  on  xz  is  Gaussian 
with  variance  o 7  =  2(0.01  )2.  The  recursive  methods  are  initialized 
with  an  initial  total  capacity  estimate  of  99  Ah. 

Representative  results  of  this  scenario  are  presented  in  Fig.  6. 
These  are  very  similar  in  most  respects  to  the  HEV  scenario  2  results. 
Again,  WLS  fails  because  its  error  bounds  are  far  too  tight,  leading 
to  a  vanishingly  small  goodness  of  fit.  WTLS,  TLS,  and  AWTLS  all 
give  good  results,  with  TLS  and  AWTLS  giving  the  best  results  due 
to  lower  error  bounds  because  of  the  possibility  of  initialization. 

6.5.  Battery  electric  vehicle  application ,  scenario  2 

The  asymptotic  quality  of  the  total  capacity  estimates  is  lim¬ 
ited  by  the  noise  on  the  SOC  estimation  error.  If  this  noise  can 
be  reduced,  the  total  capacity  estimates  can  become  much  more 
accurate.  The  BEV  application  allows  a  means  to  do  this:  whenever 
the  battery  pack  is  fully  charged,  we  have  a  precisely  known  end¬ 
point  SOC.  Therefore,  either  z(U)  or  z(t2)  can  be  known  “exactly” 
for  every  total  capacity  estimate  update.  This  then  allows  us  to  use 

o$l=o?={  0.01  )2. 

The  tradeoff  is  that  we  no  longer  have  regular  updates.  Updates 
happen  randomly,  whenever  the  vehicle  is  charged.  Therefore,  mz 
becomes  a  random  variable.  For  this  work,  we  treat  mz  as  a  log¬ 
normal  random  variable  with  mode  0.5  h  and  standard  deviation 
0.6  h.  This  gives  the  probability  density  function  of  drive  cycle  dura¬ 
tions  that  is  shown  in  Fig.  7.  (Other  PDFs  could  easily  be  used: 


this  one  was  chosen  to  give  reasonable  duration  drive  cycles  that 
encompassed  a  variety  of  driving  behaviors  and  distances.)  Also, 
since  a  greater  fraction  of  the  battery  pack  would  be  used  for  an 
entire  drive  cycle  than  for  a  regular  periodic  update,  we  use  an  80% 
range  of  SOC,  so  the  true  value  of  xz  is  computed  to  be  a  uniform 
random  number  from  -0.8  to  +0.8. 


EV  Scenario  2:  Capacity  Estimates  with  Error  Bounds 

— ■ — i — ■ — i — * — i — ■ — i — • — i — ■ — i — * — i — ■ — i  ■  i  i 

- WLS 

WTLS 

- -  TLS 

- AWTLS 


—  100 
0 

cc 

E 

99.5 

co 

0 


98.5 


CO 

CO 

0 

c 

~o 

o 

o 

CD 


EV  Scenario  2:  Goodness  of  fit  for  each  method 


0.8 


0.7 


0.6 

0.5 

0.4 


WLS 

WTLS 

TLS 

AWTLS 


0.3 


0.2 

0.1 

0 


0  100  200  300  400  500  600  700  800 


900  1000 


Algorithm  update  index 


Algorithm  update  index 


Fig.  8.  Results  from  battery  electric  vehicle  application  simulation  scenario  2. 
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Fig.  9.  Results  from  battery  electric  vehicle  application  simulation  scenario  3. 


Results  from  this  scenario  are  presented  in  Fig.  8.  WLS  fails  once 
again,  but  this  time  TLS  also  fails.  TLS  fails  because  gX[  koy[  due 
to  the  variable-length  drive  cycles.  The  estimate  given  by  TLS  is 
actually  quite  reasonable,  but  the  goodness  of  fit  is  very  small.  WTLS 
gives  good  results,  and  AWTLS  gives  the  best  results  due  to  lower 
error  bounds  because  of  the  ability  to  initialize  the  estimate. 

Note  that  the  asymptotic  three-sigma  error  bounds  drop  from 
about  ±1%  to  about  ±0.15%  of  the  total  capacity  due  to  having  a 
lower  value  of  oX[  and  also  due  to  the  wider  range  in  Xj. 

6.6.  Battery  electric  vehicle  application,  scenario  3 

The  final  scenario  we  consider  is  identical  to  BEV  scenario  2, 
except  that  we  simulate  a  changing  total  capacity.  The  slope  of 
the  total  capacity  curve  is  chosen  to  be  -0.01  Ah  per  measure¬ 
ment  update,  and  y  =  0.98  was  used.  Representative  results  of  this 
scenario  are  presented  in  Fig.  9.  Once  again,  WLS  fails  and  TLS  is 
uncertain  of  its  estimate  for  nearly  1 00  updates.  However,  TLS  does 
recover  and  do  quite  well.  The  AWTLS  method  gives  the  best  results. 

7.  Discussion 

The  simulation  results  have  illustrated  a  few  key  properties  of 
the  four  methods  we  have  discussed  to  estimate  total  capacity: 

•  Noise  on  the  SOC  estimates  must  be  considered  in  order  to  prop¬ 
erly  estimate  battery  total  capacity.  Least  squares,  weighted  least 
squares,  and  other  similar  methods  simply  fail.  They  give  biased 
estimates  of  total  capacity,  with  unreliable  error  bounds.  Methods 
related  to  total  least  squares,  where  noises  on  the  SOC  estimates 
are  explicitly  recognized  and  incorporated  in  the  calculations,  are 
required  for  reliable  total  capacity  estimation. 

•  WTLS,  in  principle  always  gives  the  best  results.  However,  we 
have  seen  that  in  practice  that  the  TLS  and  AWTLS  methods  can 
give  better  results  because  they  can  be  initialized  with  a  nominal 
capacity  estimate.  Furthermore,  since  TLS  and  AWTLS  give  nice 
recursive  solutions,  one  of  them  should  always  be  used  instead 
of  WTLS. 

•  If  the  measurement  update  interval  mz  is  constant,  and  therefore 
oX[  =  kcry.  for  all  measurements,  TLS  and  AWTLS  give  identical 
results.  Therefore,  the  simpler  TLS  method  is  preferred.  However, 
if  gx.  fz  kGyi,  the  AWTLS  method  gives  better  results  than  TLS,  and 
sometimes  TLS  fails.  This  is  particularly  important  for  the  BEV 
application  where  total  capacity  estimate  updates  are  done  when 
charging  the  battery,  to  yield  a  greatly  improved  total  capacity 
estimate  because  of  the  reduction  in  gX[  due  to  knowing  one  SOC 


value  exactly.  AWTLS  always  gives  results  at  least  as  good  as  the 
other  methods. 

•  The  noises  that  contribute  to  Gy[  are  assumed  to  be  due  to  cur¬ 
rent  sensor  errors.  In  practice,  these  can  include  gain  errors,  bias 
errors,  noise  errors,  and  nonlinear  errors.  We  have  considered 
only  the  noise  errors  here.  Gain  errors  and  nonlinear  errors  will 
bias  all  of  the  methods;  however,  we  believe  that  the  biased  value 
of  the  total  capacity  estimate  will  be  consistent  with  the  per¬ 
ceived  capacity  of  the  battery  pack  if  the  same  current  sensor  is 
used  to  compute  the  battery  pack  total  capacity  estimate  and  to 
monitor  pack  operations.  Bias  error  can  be  subtracted  in  a  BEV  set¬ 
ting  if  we  can  assume  that  the  Coulombic  efficiency  of  the  cells  is 
rj^  1  by  matching  the  discharged  ampere  hours  from  usage  with 
the  charged  ampere  hours. 

•  The  output  error  bounds  on  the  total  capacity  estimate,  even  with 
the  optimum  WTLS  estimator,  are  larger  than  some  might  expect. 
This  underscores  the  need  for  a  method  that  predicts  not  only  the 
estimate,  but  also  dynamic  error  bounds  on  the  estimate,  as  do 
the  ones  proposed  in  this  article.  Without  dynamic  error  bounds, 
the  user  of  the  total  capacity  estimate  has  no  idea  how  good  or  bad 
that  estimate  is.  If  the  estimate  is  used  to  compute  battery  pack 
available  energy,  for  example,  the  energy  estimate  may  be  overly 
optimistic  or  overly  pessimistic,  neither  of  which  is  acceptable. 

8.  Conclusions 

In  this  paper,  we  have  investigated  several  methods  to  estimate 
battery  cell  total  capacity.  We  have  shown  that  standard  meth¬ 
ods,  which  consider  noises  on  only  the  accumulated  ampere  hours 
and  not  on  the  SOC  estimates,  are  biased  and  give  incorrect  error 
bounds  on  their  estimates.  Weighted  total  least  squares  considers 
noise  on  both  the  accumulated  ampere  hours  and  on  the  SOC  esti¬ 
mates  used  to  estimate  total  capacity,  and  hence  the  total  capacity 
estimates  are  far  more  robust.  Weighted  total  least  squares,  how¬ 
ever,  does  not  allow  a  recursive  solution,  so  is  inappropriate  for  an 
embedded  systems  implementation.  Standard  total  least  squares 
can  be  implemented  recursively,  and  under  some  scenarios,  such  as 
the  HEV  application,  gives  results  that  are  as  accurate  as  weighted 
total  least  squares.  However,  when  the  noises  on  the  accumulated 
ampere  hours  and  on  the  SOC  estimates  are  not  proportional,  total 
least  squares  fails.  To  address  this  shortcoming,  we  have  introduced 
an  approximate  weighted  total  least  squares  method  that  is  recur¬ 
sive,  and  gives  results  that  are  indistinguishable  from  those  from 
weighted  total  least  squares  in  all  settings  and  are  at  least  as  good 
as  all  methods  considered.  Furthermore,  estimate  error  bounds  and 
a  goodness-of-fit  metric  may  be  readily  computed. 
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